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Description 

MORE ECONOMICAL RESOURCE APPLICATION ON THE USER INTERACTION 
WITH A SPEECH DIALOGUE SYSTEM IN A PACKET NETWORK BY MEANS OF A 
5 SIMPLIFYING PROCESSING OF SIGNALLING INFORMATION 

The invention relates to methods and devices for a simplifying 
processing of signaling information during a dialogue with a 
speech dialogue system in a packet network. 

10 

One of the most important current developments affecting the 
fields of network technologies, of call processing, and of 
Internet technologies, is the realization of services with 
real-time transmission via packet networks. 

15 

At the present moment, most speech transmission is handled via 
line switched networks - also known as TDM (time division 
multiplexing) networks. The aim for the future is to transmit a 
greater amount of speech via packet oriented networks which are 

20 currently used mainly for data transmission. Hereby, the so 
called IP (Internet protocol) networks are the most important 
class of packet networks. In addition, in future there will be 
further transmission capacity intensive real-time services, 
such as, for example, the transmission of video data during a 

25 video -on- demand service. 

An important class of real-time services is the automated 
provision of speech or video information. One example of this 
type of service is given by the recorded announcement services 
30 known from TDM networks, e.g. telephone number announcements 
(,,the telephone number of the subscriber is ...") or error 
messages („the number you are trying to call is not 
available"). Thereby, automated information output can also 
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contain subscriber specific information (e.g. telephone 
numbers) . Dialogue f \inctionality is an extension of the 
announcement functionality. There the user can control the 
service or the dialogue by using the keys in his terminal 
5 device or by means of speech input. Servers are used to achieve 
such kinds of services in packet networks . In the case of 
interactive services, the term IVR (interactive voice response) 
server is commonly used. A nxomber of coding methods or codecs 
(coder-decoder), such as, for example, G.71lA/u, G. 723.1, 

10 G.726, G.728 and G.729A/B were standardized for the 

transmission of speech. Standards H.261 and H.263, for example, 
are used for the transmission of video information. For an 
information output, usually a codec or coding method that is 
supported by both ends of the network is selected for the 

15 information transmission in a so called codec negotiation. 

For services with real-time transmission via data networks, it 
is essential that the service characteristics known from the 
TDM network be provided for corresponding or new services with 
20 comparable quality and efficiency. The optimization of the 
resource application plays an important part in this . 

The task of the invention is to improve the efficiency of the 
resource application in the automated information output. 

25 

The invention is based on the following consideration. The 
signaling with relation to an interactive dialogue with a 
speech dialogue system or an IVR (Interactive Voice Response) 
server, e.g. for an output of information, is usually carried 
30 out using DTMF* signals (DTMF: Dual Tone Multiple Frequency). 
With this signaling - also frequently called tone dialing or 
dual tone frequency dialing - an interaction between the 
subscriber and the speech dialogue system is realized by means 
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of an exchange of coded information through frequencies. Three 
scenario's can be distinguished in the transmission of DMFT 
signals via a packet network: 

5 • The DMFT signals are contained in the payload stream r In this 
connection, one also talks about in-band transmission. In-band 
transmission is only used in^ conjunction with non-compressing 
coding methods or codecs such as, for example, G-711. 

10 • DMFT signals and payload are transmitted separately, i.e. 
out-of-band transmission of DTMF signaling information is 
carried out. 

• Die DMFT signals are transmitted in the payload stream in 
15 separately labeled data packets. A transmission of this type 
was^ standardized by the IETF in the Request for Comments RFC 
2833 for the RTF (real-time protocol) packet format. 

With in-band transmission of DTMF signals, usually special 
20 hardware resources, for example, designed with DSPs (DSP: 
digital signaling processor) or ASICs (ASIC: Application 
specific integrated circuit) are required in the speech 
dialogue system or the IVR server for the analysis of the DTMF 
signals. With relation to the invention, in-band transmission 
25 of DTMF signals is largely avoided and the use of speech 

dialogue systems or IVR servers without hardware resources for 
the recognition of DTMF signaling is proposed. 

The coding method and the type of exchange of DTMF signals for 
30 an automated information output is usually determined during a 

so called codec negotiation between packet network terminals. 
The first packet network terminal is represented, for example, 
by a network interface device or a media gateway or by a packet 
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based terminal linked directly to the packet network. The 
second packet network terminal is the speech dialogue system. 
In the codec negotiation a codec supported by both terminals 
and by the network is selected from a list of codecs. Usually, 
5 when a codec is selected, by default or by presetting, the type 
of transmission of the DTMF signals is also determined, e.g. 
the selection of the coding method G.711 is linked to in-band 
transmission of the DTMF signals. With relation to the 
invention, two methods will be shown to exclude in-band 
1 0 t r ansmi s s i on : 

In the first method, only out-of -band- signaling or signaling by 
means of specially labeled data packets is permitted in the 
codec negotiation. Coding methods that involve an in-band 
15 signaling of DTMF signals are practically eliminated from the 
list of applicable codecs during the codec negotiation. 

The second method includes an extension of the logic of the 
control device controlling the speech dialogue system. The 

20 control device (e.g. a packet based exchange, a call server, a 
proxy server or a soft switch) is embodied such that said 
control device signals to the remote packet network terminal 
involved in the codec negotiation to use out-of-band-signaling 
of DTMF signals independently of the selected codec. In this 

25 case, as a rule the codec G.711 can also be selected. 

The invention has the advantage that speech dialogue systems 
can be used without complex hardware resources. In principle, 
for services with user interaction or automated information 
30 output, it is then also possible to use so called general 

purpose platforms, that is multi-function computers with open 
interfaces that provide the desired IVR or information output 
resources through their software tools. The provision of 
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dedicated hardware is no longer necessary. 

According to a further embodiment - as a backup so to speak - 
provision is made for the rerouting of the job to a speech 
5 dialogue system with dedicated hardware for those cases where 
the in-band signaling of DTMF signals cannot be achieved with 
the above procedure. That would be the case in the first method 
if no codec supported by the two packet network terminals can 
be identified with out-of-band signaling and there is also no 

10 provision for signaling by means of a packet specifically 
provided for DTMF signals. In this case, according to the 
further embodiment, the service is switched to a speech 
dialogue system with dedicated hardware. With the second 
method, in the course of which, a control device stipulates 

15 out-of-band-signaling for the first packet network terminal 

independently of the codec selected, it is possible to forward 
the call to the speech dialogue system with hardware for DTMF 
signal recognition if the necessary resources or technical 
support for the out-of-band transmission are not available. 

20 

The further embodiment allows a service to be dealt with also 
in cases where it is not possible to carry out a service with 
the speech dialogue system without special hardware. Otherwise, 
service requirements of that type would have to be refused. As 

25 a rule, however, it will be possible to provide the service 
through the speech dialogue system without special hardware. 
Therefore, the provision of one backup speech dialogue system 
will be sufficient for a large number of speech dialogue system 
without special hardware. Alternatively, the speech dialogue 

30 system with special hardware can be assigned only to one speech 
dialogue system without dedicated hardware, whereby the speech 
dialogue system without special hardware is accordingly more 
powerfully sized, i.e. with respect to the available resources 
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it is designed for the processing of a lot more service 
requirements per time unit than the other speech dialogue 
system. 

5 In the following, the subject of the invention will be 

described in more detail below with reference to an embodiment 
using a figure. 

There are two IVR servers, IVRl and IVR2, represented in the 

10 figure, with the first IVR server IVRl having no special 

hardware for the processing ' of DTMF signals, the second IVR 
server, however, does. Subscribers TLN are represented that are 
connected to' a TDM network PSTN/ ISDN. Voice communications of 
the subscriber TLN can be switched via a switching system 

15 switch. The TDM network ISDN/PSTN is connected with an IP 

network IPNet by means of a media gateway GW. Here, this is, 
for example, a core network. Within the IP network IPNet, 
signaling information and payload ND are routed separately. 
Signaling information coming from the TDM network IDSN/PSTN is 

20 transferred via a so called signaling transfer point STP to a 
call server CS2 . Signaling is carried out with the help of 
signals of the SS7 signaling system. Using the MGCP protocol 
(Media gateway Control Protocol) , the call server CS2 exchanges 
signaling information with the gateway GW and the speech 

25 dialogue systems IVRl and IVR2 . The H.248 protocol could also 
be used instead of the MGCP protocol. With the help of the MGCP 
(Media gateway Control Protocol) protocol, the call server CSl 
can also control the speech dialogue system IVRl. 

30 . According to the invention, the selection of a codecs is 

carried out in the course of an interactive dialogue with a 
subscriber TLN as follows: 
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With relation to the codec negotiation, a type of DTMF 
signaling without in-band signaling is required through the 
selection of the coding method, which DTMF signaling occurs in 
the packet network by means of signaling between the IP 
5 terminals and the call server CS2 . This procedure executes in 
such a way that the A-side IP terminal, represented by the 
gateway GW, signals a prioritized list of voice codecs, fax and 
tone capabilities to the call server CS2 during the call set- 
up. In compliance with administrable defaults, the call server 

10 CS2 can delete from the list the voice codecs/ capabilities 

that should not be used in the network, or said call server can 
change the priorities. The modified list is delivered to the B- 
side IP terminal, in this case the speech dialogue system IVRl . 
This system compares the list received via the signaling list 

15 with its own list and eliminates the voice codecs/ capabilities 
that are not contained in both lists. The list thus checked and 
possibly modified is signaled back to the A-side IP terminal 
via the call server CS2, and sets the selection of voice 
codecs/ capabilities that are to be used. 

20 

During the codec negotiation the speech dialogue system IVRl 
only offers the signaling to RFC2833 and compressing voice 
codecs, which do not permit an in-band DTMF signaling and 
therefore inevitably result in a DTMF out-of-band signaling. 

25 That means that in special the usually used, but DTMF 

transparent regression coders (e.g. G.711) are not contained in 
the codec list signaled back by the IVR. The demand for the 
DTMF out-of-band signaling is -made by the call server CS2 by 
means of signaling to the A-side IP terminal. To this end the 

30 call server CS2 has a logic that checks the voice codec 

negotiated by the codec negotiation. If this is a compressing 
voice codec (e.g. G.723), the call server signals the DTMF out- 
of-band transmission to the A-side IP terminal. 
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There is a peripheral device with virtual announcement and/or 
dialogue ports in the call server CS2 . The speech dialogue 
system IVRl and possibly also the speech dialogue system IVR2 
5 are controlled via said peripheral device. This peripheral 

device with virtual announcement and/or dialogue ports converts 
the seizures of its ports by the call server CS2 into seizure 
signaling of the allocated ports leading to the speech dialogue 
systems. This peripheral device also outputs the jobs to play 

10 announcements and dialogues towards the speech dialogue 

systems. Acknowledgements from the speech dialogue systems IVRl 
or IVR2 indicating the end of the annoiincement or containing 
the input of the end user ensue at the assigned peripheral 
device with virtual announcement and/or dialogue port. All the 

15 signaling between the peripheral device responsible for the 
virtual announcement and/or dialogue port and the assigned 
external speech dialogue systems IVRl or IVR2 ensues via the 
signaling protocol MGCP that is used to access the media 
gateway MG. 

20 

For the case that the A- side IP terminal does not support any 
of the voice codecs offered by the IVRl that lead to DTMF out- 
of-band signaling, or RFC2833, the service requirement is 
automatically rerouted or forwarded to an alternative speech 

25 dialogue system IVR2 that also supports the voice codecs with 
in-band DTMF signaling. Since, as a rule, there is only a small 
number of IP terminals that exclusively support voice codecs 
with in-band DTMF signaling, the channel number of the speech 
dialogue system IVR2 can be substantially smaller than that of 

30 the speech dialogue system IVRl, thus achieving an optimization 
of costs for the overall IVR functionality to be made 
available . 
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The codec negotiation that takes place during the call set-up 
is used as a trigger event for the rerouting of the speech 
dialogue system IVRl to the speech dialogue system IVR2 . The 
speech dialogue system IVRl determines on the basis of the 
5 codec negotiation that there is no match between the voice 
codecs of the A-side IP terminal and of the speech dialogue 
system IVRl, and signals a corresponding error (e.g. error code 
543 „Codec Negotiation Error") to the peripheral device with 
virtual announcement and/or dialogue port in the call server. 

10 The peripheral device evaluates this error and, by means of the 
data link control in the call server, initiates thereupon a 
removal of the connection to the speech dialogue system IVRl, 
followed by a setting up of the connection to the speech 
dialogue system IVR2 . The connection to the A-side IP terminal 

15 is maintained during this rerouting procedure. The addresses of 
the speech dialogue systems IVRl and IVR2 are administered in 
the. database of the call server CS2 . 

It is also conceivable that the speech dialogue system with 
20 special hardware IVR2 is controlled by a different call server 
from the call server CS2, e.g. by the call server CSl. In this 
case, the rerouting from the speech dialogue system IVRl to the 
speech dialogue system IVR2 can be achieved by exchanging 
appropriate signaling information between the two call servers 
25 CS2 and CSl e.g. by means of the BICC (Bearer Independent Call 
Control ) protocol . 

An alternative procedure is based on the extension of the logic 
in the call server CS2 to require the DTMF out-of-band 
30 signaling. This extension is that the logic checks whether the 
B-side IP terminal is the speech dialogue system IVRl. In this 
case the speech dialogue system IVRl also offers the non- 
compressing voice codecs, and the call server CS2, 
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independently of the selected voice codec, always signals DTMF 
out-of-band transmission to the A-side IP terminal. 



